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Stereo 



SEQUENCE LISTING 
4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: BROW, MARY ANN D. 

7 LYAMICHEV, VICTOR I. 

8 OLIVE, DAVID M. 

10 (ii) TITLE OF INVENTION: RAPID DETECTION AND IDENTIFICATION OF 

11 PATHOGENS 
13 (iii) NUMBER OF SEQUENCES: 165 

15 (iv) CORRESPONDENCE ADDRESS: 

16 (A) ADDRESSEE: MEDLEN & CARROLL 

17 (B) STREET: 220 MONTGOMERY STREET, SUITE 2200 

18 (C) CITY: SAN FRANCISCO 

19 (D) STATE: CALIFORNIA 

20 (E) COUNTRY: UNITED STATES OF AMERICA 

21 (F) ZIP : 94104 

2 3 (v) COMPUTER READABLE FORM: 

24 (A) MEDIUM TYPE: Floppy disk 

2 5 (B) COMPUTER: IBM PC compatible 

26 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

27 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
29 (vi) CURRENT APPLICATION DATA: 

C--> 30 (A) APPLICATION NUMBER: US/09/940 , 925A 

C--> 31 • (B) FILING DATE: 10-Jun-2002 

32 (C) CLASSIFICATION: 

34 (viii) ATTORNEY/AGENT INFORMATION: 

35 (A) NAME: CARROLL, PETER G. 

36 (B) REGISTRATION NUMBER: 32,837 

37 (C) REFERENCE/DOCKET NUMBER: FORS-01756 

39 (ix) TELECOMMUNICATION INFORMATION: 

40 (A) TELEPHONE: (415) 705-8410 

41 (B) TELEFAX: (415) 397-8338 
44 (2) INFORMATION FOR SEQ ID NO: 1: 

4 6 (i) SEQUENCE CHARACTERISTICS: 

47 (A) LENGTH: 2506 base pairs 

48 (B) TYPE: nucleic acid 

49 (C) STRANDEDNESS : double 

50 (D) TOPOLOGY: linear 

52 (ii) MOLECULE TYPE: DNA (genomic) 

56 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : ; 

58 ATGAGGGGGA TGCTGCCCCT CTTTGAGCCC AAGGGCCGGG TCCTCCTGGT GGACGGCCAC 60 
60 CACCTGGCCT ACCGCACCTT CCACGCCCTG AAGGGCCTCA CCACCAGCCG GGGGGAGCCG 120 
62 GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTCAAGGA GGACGGGGAC 180 
64 GCGGTGATCG TGGTCTTTGA CGCCAAGGCC CCCTCCTTCC GCCACGAGGC CTACGGGGGG 24 0 
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66 


TACAAGGCGG 


GCCGGGCCCC 


CACGCCGGAG 


GACTTTLCLC 


VjVjOAAV- 1 


t-v- 1 V^A 1 CAAvj 




68 


GAGCTGGTGG 


ACCTCCTGGG 


GCTGGCGCGC 


CTCGAGGTCC 


CvjCjGC 1 ACAj A 


rTTTT* A /^"/^ TV 

Vj CO Vj A Cvj A L. 


jDU 


70 


GTCCTGGCCA 


GCCTGGCCAA 


GAAGGCGGAA 


AAGGAGGGCT 




CA 1 LL 1 LACL 


a o n 

4zU 


72 


GCCGACAAAG 


ACCTTTACCA 


GCTCCTTTCC 


GACCGCATCC 


ACG TCCTCCA 




** OU 


74 


TACCTCATCA 


CCCCGGCCTG 


GCTTTGGGAA 


AAGTACGGCC 








76 


GACTACCGGG 


CCCTGACCGG 


GGACGAGTCC 


GACAACCTTC 


CCGGGGTCAA 


GGGCATCGGG 


£ r\ t\ 

600 


78 


GAGAAGACGG 


CGAGGAAGCT 


TCTGGAGGAG 


TGGGGGAGCC 


TGGAAGCCCT 


CCTCAAGAAC 


660 


80 


CTGGACCGGC 


TGAAGCCCGC 


CATCCGGGAG 


AAGATCCTGG 


CCCACATGGA 


CGATCTGAAG 


720 


82 


CTCTCCTGGG 


ACCTGGCCAA 


GGTGCGCACC 


GACCTGCCCC 


TGGAGGTGGA 


CTTCGCCAAA 


780 


84 


AGGCGGGAGC 


CCGACCGGGA 


GAGGCTTAGG 


GCCTTTCTGG 


AGAGGCTTGA 


GTTTGGCAGC 


840 


86 


CTCCTCCACG 


AGTTCGGCCT 


TCTGGAAAGC 


CCCAAGGCCC 


TGGAGGAGGC 


CCCCTGGCCC 


900 


88 


CCGCCGGAAG 


GGGCCTTCGT 


GGGCTTTGTG 


CTTTCCCGCA 


AGGAGCCCAT 


GTGGGCCGAT 


960 


90 


CTTCTGGCCC 


TGGCCGCCGC 


CAGGGGGGGC 


CGGGTCCACC 


GGGCCCCCGA 


GCCTTATAAA 


1020 


92 


GCCCTCAGGG 


ACCTGAAGGA 


GGCGCGGGGG 


CTTCTCGCCA 


AAGACCTGAG 


CGTTCTGGCC 


1080 


94 


CTGAGGGAAG 


GCCTTGGCCT 


CCCGCCCGGC 


GACGACCCCA 


TGCTCCTCGC 


CTACCTCCTG 


1140 


96 


GACCCTTCCA 


ACACCACCCC 


CGAGGGGGTG 


GCCCGGCGCT 


ACGGCGGGGA 


GTGGACGGAG 


1200 


98 


GAGGCGGGGG 


AGCGGGCCGC 


CCTTTCCGAG 


AGGCTCTTCG 


CCAACCTGTG 


GGGGAGGCTT 


1260 



100 GAGGGGGAGG AGAGGCTCCT TTGGCTTTAC CGGGAGGTGG AGAGGCCCCT TTCCGCTGTC 1320 

102 CTGGCCCACA TGGAGGCCAC GGGGGTGCGC CTGGACGTGG CCTATCTCAG GGCCTTGTCC 1380 

104 CTGGAGGTGG CCGAGGAGAT CGCCCGCCTC GAGGCCGAGG TCTTCCGCCT GGCCGGCCAC 144 0 

106 CCCTTCAACC TCAACTCCCG GGACCAGCTG GAAAGGGTCC TCTTTGACGA GCTAGGGCTT 1500 

108 CCCGCCATCG GCAAGACGGA GAAGACCGGC AAGCGCTCCA CCAGCGCCGC CGTCCTGGAG 1560 

110 GCCCTCCGCG AGGCCCACCC CATCGTGGAG AAGATCCTGC AGTACCGGGA GCTCACCAAG 1620 

112 CTGAAGAGCA CCTACATTGA CCCCTTGCCG GACCTCATCC ACCCCAGGAC GGGCCGCCTC 1680 

114 CACACCCGCT TCAACCAGAC GGCCACGGCC ACGGGCAGGC TAAGTAGCTC CGATCCCAAC 1740 

116 CTCCAGAACA TCCCCGTCCG CACCCCGCTT GGGCAGAGGA TCCGCCGGGC CTTCATCGCC 1800 

118 GAGGAGGGGT GGCTATTGGT GGCCCTGGAC TATAGCCAGA TAGAGCTCAG GGTGCTGGCC 1860 

12 0 CACCTCTCCG GCGACGAGAA CCTGATCCGG GTCTTCCAGG AGGGGCGGGA CATCCACACG 1920 

122 GAGACCGCCA GCTGGATGTT CGGCGTCCCC CGGGAGGCCG TGGACCCCCT GATGCGCCGG 1980 

124 GCGGCCAAGA CCATCAACTT CGGGGTCCTC TACGGCATGT CGGCCCACCG CCTCTCCCAG 2040 

126 GAGCTAGCCA TCCCTTACGA GGAGGCCCAG GCCTTCATTG AGCGCTACTT TCAGAGCTTC 2100 

128 CCCAAGGTGC GGGCCTGGAT TGAGAAGACC CTGGAGGAGG GCAGGAGGCG GGGGTACGTG 2160 

130 GAGACCCTCT TCGGCCGCCG CCGCTACGTG CCAGACCTAG AGGCCCGGGT GAAGAGCGTG 2220 

132 CGGGAGGCGG CCGAGCGCAT GGCCTTCAAC ATGCCCGTCC AGGGCACCGC CGCCGACCTC 2280 

134 ATGAAGCTGG CTATGGTGAA GCTCTTCCCC AGGCTGGAGG AAATGGGGGC CAGGATGCTC 2340 

136 CTTCAGGTCC ACGACGAGCT GGTCCTCGAG GCCCCAAAAG AGAGGGCGGA GGCCGTGGCC 2400 

138 CGGCTGGCCA AGGAGGTCAT GG AGGGGGTG TATCCCCTGG CCGTGCCCCT GGAGGTGGAG 2460 

14 0 GTGGGGATAG GGGAGGACTG GCTCTCCGCC AAGGAGTGAT ACCACC 2506 
142 (2) INFORMATION FOR SEQ ID NO: 2: 



144 (i) SEQUENCE CHARACTERISTICS: 

145 (A) LENGTH: 2496 base pairs 

146 (B) TYPE: nucleic acid 
14 7 (C) STRANDEDNESS : double 
148 (D) TOPOLOGY: linear 

150 (ii) MOLECULE TYPE: DNA (genomic) 

154 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 



156 ATGGCGATGC TTCCCCTCTT TGAGCCCAAA GGCCGCGTGC TCCTGGTGGA CGGCCACCAC 60 
158 CTGGCCTACC GCACCTTCTT TGCCCTCAAG GGCCTCACCA CCAGCCGCGG CGAACCCGTT 120 
160 CAGGCGGTCT ACGGCTTCGC CAAAAGCCTC CTCAAGGCCC TGAAGGAGGA CGGGGACGTG 180 
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162 GTGGTGGTGG TCTTTGACGC CAAGGCCCCC TCCTTCCGCC ACGAGGCCTA CGAGGCCTAC 24 0 

164 AAGGCGGGCC GGGCCCCCAC CCCGGAGGAC TTTCCCCGGC AGCTGGCCCT CATCAAGGAG 300 

166 TTGGTGGACC TCCTAGGCCT TGTGCGGCTG GAGGTTCCCG GCTTTGAGGC GG ACGACGTG 360 

168 CTGGCCACCC TGGCCAAGCG GGCGGAAAAG GAGGGGTACG AGGTGCGCAT CCTCACTGCC 4 20 

170 GACCGCGACC TCTACCAGCT CCTTTCGGAG CGCATCGCCA TCCTCCACCC TGAGGGGTAC 480 

172 CTGATCACCC CGGCGTGGCT TTACGAGAAG TACGGCCTGC GCCCGGAGCA GTGGGTGGAC 54 0 

174 TACCGGGCCC TGGCGGGGGA CCCCTCGGAT AACATCCCCG GGGTG AAGGG CATCGGGGAG 600 

176 AAGACCGCCC AGAGGCTCAT CCGCGAGTGG GGGAGCCTGG AAAACCTCTT CCAGCACCTG 660 

178 GACCAGGTGA AGCCCTCCTT GCGGGAGAAG CTCCAGGCGG GCATGGAGGC CCTGGCCCTT 720 

180 TCCCGGAAGC TTTCCCAGGT GCACACTGAC CTGCCCCTGG AGGTGGACTT CGGGAGGCGC 780 

182 CGCACACCCA ACCTGGAGGG TCTGCGGGCT TTTTTGGAGC GGTTGGAGTT TGGAAGCCTC 84 0 

184 CTCCACGAGT TCGGCCTCCT GGAGGGGCCG AAGGCGGCAG AGGAGGCCCC CTGGCCCCCT 900 

186 CCGGAAGGGG CTTTTTTGGG CTTTTCCTTT TCCCGTCCCG AGCCCATGTG GGCCGAGCTT 960 

188 CTGGCCCTGG CTGGGGCGTG GGAGGGGCGC CTCCATCGGG CACAAGACCC CCTTAGGGGC 1020 

190 CTGAGGGACC TTAAGGGGGT GCGGGGAATC CTGGCCAAGG ACCTGGCGGT TTTGGCCCTG 1080 

192 CGGGAGGGCC TGGACCTCTT CCCAGAGGAC GACCCCATGC TCCTGGCCTA CCTTCTGGAC 114 0 

194 CCCTCCAACA CCACCCCTGA GGGGGTGGCC CGGCGTTACG GGGGGGAGTG GACGGAGGAT 1200 

196 GCGGGGGAGA GGGCCCTCCT GGCCGAGCGC CTCTTCCAGA CCCTAAAGGA GCGCCTTAAG 1260 

198 GGAGAAGAAC GCCTGCTTTG GCTTTACGAG GAGGTGGAGA AGCCGCTTTC CCGGGTGTTG 1320 

200 GCCCGGATGG AGGCCACGGG GGTCCGGCTG GACGTGGCCT ACCTCCAGGC CCTCTCCCTG 1380 

202 GAGGTGGAGG CGGAGGTGCG CCAGCTGGAG GAGGAGGTCT TCCGCCTGGC CGGCCACCCC 144 0 

204 TTCAACCTCA ACTCCCGCGA CCAGCTGGAG CGGGTGCTCT TTGACGAGCT GGGCCTGCCT 1500 

206 GCCATCGGCA AGACGGAGAA GACGGGGAAA CGCTCCACCA GCGCTGCCGT GCTGGAGGCC 1560 

208 CTGCGAGAGG CCCACCCCAT CGTGGACCGC ATCCTGCAGT ACCGGGAGCT CACCAAGCTC 1620 

210 AAGAACACCT ACATAGACCC CCTGCCCGCC CTGGTCCACC CCAAGACCGG CCGGCTCCAC 1680 

212 ACCCGCTTCA ACCAGACGGC CACCGCCACG GGCAGGCTTT CCAGCTCCGA CCCCAACCTG 174 0 

214 CAGAACATCC CCGTGCGCAC CCCTCTGGGC CAGCGCATCC GCCGAGCCTT CGTGGCCGAG 1800 

216 GAGGGCTGGG TGCTGGTGGT CTTGGACTAC AGCCAGATTG AGCTTCGGGT CCTGGCCCAC 1860 

218 CTCTCCGGGG ACGAGAACCT GATCCGGGTC TTTCAGGAGG GGAGGGACAT CCACACCCAG 1920 

220 ACCGCCAGCT GG ATGTTCGG CGTTTCCCCC GAAGGGGTAG ACCCTCTGAT GCGCCGGGCG 1980 

222 GCCAAGACCA TCAACTTCGG GGTGCTCTAC GGCATGTCCG CCCACCGCCT CTCCGGGGAG 204 0 

224 CTTTCCATCC CCTACGAGGA GGCGGTGGCC TTCATTGAGC GCTACTTCCA GAGCTACCCC 2100 

226 AAGGTGCGGG CCTGGATTGA GGGGACCCTC GAGGAGGGCC GCCGGCGGGG GTATGTGGAG 2160 

228 ACCCTCTTCG GCCGCCGGCG CTATGTGCCC GACCTCAACG CCCGGGTGAA GAGCGTGCGC 2220 

230 GAGGCGGCGG AGCGCATGGC CTTCAACATG CCGGTCCAGG GCACCGCCGC CGACCTCATG 2280 

232 AAGCTGGCCA TGGTGCGGCT TTTCCCCCGG CTTCAGGAAC TGGGGGCGAG GATGCTTTTG 2340 

234 CAGGTGCACG ACGAGCTGGT CCTCGAGGCC CCCAAGGACC GGGCGGAGAG GGTAGCCGCT 24 00 

236 TTGGCCAAGG AGGTCATGGA GGGGGTCTGG CCCCTGCAGG TGCCCCTGGA GGTGGAGGTG 24 60 

238 GGCCTGGGGG AGGACTGGCT CTCCGCCAAG GAG TAG 24 96 
240 (2) INFORMATION FOR SEQ ID NO : 3: 

242 (i) SEQUENCE CHARACTERISTICS: 

243 (A) LENGTH: 2504 base pairs 

244 (B) TYPE: nucleic acid 
24 5 (C) STRANDEDNESS : double 
246 (D) TOPOLOGY: linear 

248 (ii) MOLECULE TYPE: DNA (genomic) 

252 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 3: 

254 ATGGAGGCGA TGCTTCCGCT CTTTGAACCC AAAGGCCGGG TCCTCCTGGT GGACGGCCAC 60 

256 CACCTGGCCT ACCGCACCTT CTTCGCCCTG AAGGGCCTCA CCACGAGCCG GGGCGAACCG 120 
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258 GTGCAGGCGG TCTACGGCTT CGCCAAGAGC CTCCTCAAGG CCCTGAAGGA GGACGGGTAC 180 

260 AAGGCCGTCT TCGTGGTCTT TGACGCCAAG GCCCCCTCCT TCCGCCACGA GGCCTACGAG 240 

262 GCCTACAAGG CGGGGAGGGC CCCGACCCCC GAGGACTTCC CCCGGCAGCT CGCCCTCATC 300 

264 AAGGAGCTGG TGGACCTCCT GGGGTTTACC CGCCTCGAGG TCCCCGGCTA CGAGGCGGAC 360 

266 GACGTTCTCG CCACCCTGGC CAAGAAGGCG GAAAAGGAGG GGTACGAGGT GCGCATCCTC 4 20 

268 ACCGCCGACC GCGACCTCTA CCAACTCGTC TCCGACCGCG TCGCCGTCCT CCACCCCGAG 4 80 

270 GGCCACCTCA TCACCCCGGA GTGGCTTTGG GAGAAGTACG GCCTCAGGCC GGAGCAGTGG 540 

272 GTGGACTTCC GCGCCCTCGT GGGGGACCCC TCCGACAACC TCCCCGGGGT CAAGGGCATC 6 00 

274 GGGGAGAAGA CCGCCCTCAA GCTCCTCAAG GAGTGGGGAA GCCTGGAAAA CCTCCTCAAG 6 60 

276 AACCTGGACC GGGTAAAGCC AGAAAACGTC CGGGAGAAGA TCAAGGCCCA CCTGGAAGAC 720 

278 CTCAGGCTCT CCTTGGAGCT CTCCCGGGTG CGCACCGACC TCCCCCTGGA GGTGGACCTC 780 

280 GCCCAGGGGC GGGAGCCCGA CCGGGAGGGG CTTAGGGCCT TCCTGGAGAG GCTGGAGTTC 84 0 

282 GGCAGCCTCC TCCACGAGTT CGGCCTCCTG GAGGCCCCCG CCCCCCTGGA GGAGGCCCCC 900 

284 TGGCCCCCGC CGGAAGGGGC CTTCGTGGGC TTCGTCCTCT CCCGCCCCGA GCCCATGTGG 960 

286 GCGGAGCTTA AAGCCCTGGC CGCCTGCAGG GACGGCCGGG TGCACCGGGC AGCAGACCCC 1020 

288 TTGGCGGGGC TAAAGGACCT CAAGGAGGTC CGGGGCCTCC TCGCCAAGGA CCTCGCCGTC 1080 

2 90 TTGGCCTCGA GGGAGGGGCT AGACCTCGTG CCCGGGGACG ACCCCATGCT CCTCGCCTAC 114 0 

292 CTCCTGGACC CCTCCAACAC CACCCCCGAG GGGGTGGCGC GGCGCTACGG GGGGGAGTGG 1200 

294 ACGGAGGACG CCGCCCACCG GGCCCTCCTC TCGGAGAGGC TCCATCGGAA CCTCCTTAAG 1260 

296 CGCCTCGAGG GGGAGGAGAA GCTCCTTTGG CTCTACCACG AGGTGGAAAA GCCCCTCTCC 1320 

298 CGGGTCCTGG CCCACATGGA GGCCACCGGG GTACGGCTGG ACGTGGCCTA CCTTCAGGCC 1380 

300 CTTTCCCTGG AGCTTGCGGA GGAGATCCGC CGCCTCGAGG AGGAGGTCTT CCGCTTGGCG 1440 

302 GGCCACCCCT TCAACCTCAA CTCCCGGGAC CAGCTGGAAA GGGTGCTCTT TGACGAGCTT 1500 

304 AGGCTTCCCG CCTTGGGGAA GACGCAAAAG ACAGGCAAGC GCTCCACCAG CGCCGCGGTG 156 0 

306 CTGGAGGCCC TACGGGAGGC CCACCCCATC GTGGAGAAGA TCCTCCAGCA CCGGGAGCTC 1620 

308 ACCAAGCTCA AGAACACCTA CGTGGACCCC CTCCCAAGCC TCGTCCACCC GAGGACGGGC 1680 

310 CGCCTCCACA CCCGCTTCAA CCAGACGGCC ACGGCCACGG GGAGGCTTAG TAGCTCCGAC 1740 

312 CCCAACCTGC AGAACATCCC CGTCCGCACC CCCTTGGGCC AGAGGATCCG CCGGGCCTTC 1800 

314 GTGGCCGAGG CGGGTTGGGC GTTGGTGGCC CTGGACTATA GCCAGATAGA GCTCCGCGTC 1860 

316 CTCGCCCACC TCTCCGGGGA CGAAAACCTG ATCAGGGTCT TCCAGGAGGG GAAGGACATC 1920 

318 CACACCCAGA CCGCAAGCTG GATGTTCGGC GTCCCCCCGG AGGCCGTGGA CCCCCTGATG 1980 

320 CGCCGGGCGG CCAAGACGGT GAACTTCGGC GTCCTCTACG GCATGTCCGC CCATAGGCTC 204 0 

322 TCCCAGGAGC TTGCCATCCC CTACGAGGAG GCGGTGGCCT TTATAGAGGC TACTTCCAAA 2100 

324 GCTTCCCCAA GGTGCGGGCC TGGATAGAAA AGACCCTGGA GGAGGGGAGG AAGCGGGGCT 2160 

326 ACGTGGAAAC CCTCTTCGGA AGAAGGCGCT ACGTGCCCGA CCTCAACGCC CGGGTGAAGA 2220 

328 GCGTCAGGGA GGCCGCGGAG CGCATGGCCT TCAACATGCC CGTCCAGGGC ACCGCCGCCG 2280 

330 ACCTCATGAA GCTCGCCATG GTGAAGCTCT TCCCCCGCCT CCGGGAGATG GGGGCCCGCA 2 34 0 

332 TGCTCCTCCA GGTCCACGAC GAGCTCCTCC TGGAGGCCCC CCAAGCGCGG GCCGAGGAGG 2400 

334 TGGCGGCTTT GGCCAAGGAG GCCATGGAGA AGGCCTATCC CCTCGCCGTG CCCCTGGAGG 24 60 

336 TGGAGGTGGG GATGGGGGAG GACTGGCTTT CCGCCAAGGG TTAG 2 504 
338 (2) INFORMATION FOR SEQ ID NO : 4: 



340 (i) SEQUENCE CHARACTERISTICS: 

341 (A) LENGTH: 832 amino acids 

342 (B) TYPE: amino acid 

34 3 (C) STRANDEDNESS : single 

344 (D) TOPOLOGY: linear 

346 (ii) MOLECULE TYPE: protein 

350 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

352 Met Arg Gly Met Leu Pro Leu Phe Glu Pro Lys Gly Arg Val Leu Leu 
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Lys 
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